UNIVERSITY  OF 

ILLINOIS  LIBRARY 

AT  URBANA-CHAMPAIGN 

BOOKSTACKS 


524 1   ^' 


Digitized  by  tine  Internet  Archive 

in  2011  with  funding  from 

University  of  Illinois  Urbana-Champaign 


http://www.archive.org/details/threealternative788leec 


4L0.  7^^ 


FACULTY  WORKING 
PAPER  xNO.  788 


Three  Alternative  Errors-in-Variabse  Estimation 
Methods:  A  Review  and  Extension 

Cheng-few  Lee 


Coiiec;e  of  Cornmerce  and  Business  A-jministraticn 
Bureau  c?  Sconomjc  ana  Business  Rssaarch 
University  of  'iiioofs,  Urbana-Charnpaign 


Lh\t: 


(Uf, 


BEBR 


FACULTY  WORKING  PAPER  NO.  788 
College  of  Commerce  and  Business  Administration 
University  of  Illinois  at  Urbana-Champaign 
June  1981 


Three  Alternative  Errors- in-Variable 
Estimation  Methods:   A  Review  and  Extension 


Cheng-few  Lee,  Professor 
Department  of  Finance 


R  Jl'.r.- J.  ■■  •  1' I  r^r-:    a 


Abstract 

Three  alternative  errors-in-variable  (EV)  estimation  methods,  i.e., 
classical  EV  method,  the  mathematical  programming  EV  method  and  the 
grouping  EV  method  are  reviewed  and  extended  to  multiple  regression 
cases. 


I .   Introduction 

In  discussing  the  statistical  law  of  demand,  Schultz  (19  25) 
pointed  out  that  the  traditional  ordinary  least  squares  (OLS)  can 
not  be  used  to  find  a  unique  linear  demand  curve  when  both 
explained  and  explanatory  variables  are  measured  with  errors . 
Frisch  (1943),  Koopmans  (1937)  and  Allen  (1939)  have  demonstrated 
that  a  priori  information  is  required  to  determine  the  true  regres- 
sion line  in  an  errors-in-variables  model.   Although  the  measure- 
ment error  problem  has  been  clearly  pointed  out  by  economists, 
econometricians  have  spent  most  of  their  efforts  on  the  specifica- 
tion analysis  instead  of  the  problems  associated  with  errors  of 
measurement. 

For  the  single  equation  model,  Wald  (1940)  has  developed  a 
bi-variate  grouping  method  to  determine  the  straight  line  without 
requiring  a  priori  information.   In  addition,  Reiersol  (1941)  has 
suggested  to  take  the  lagged  value  as  an  instrumental  variable 
when  the  true  component  instead  of  the  measurement  error  component 
of  the  observation  is  serially  correlated;  Tintner  (19  52)  has 
developed  the  variate  difference  method  for  the  errors-in-variables 
model;  Durbin  (19  54)  has  shown  that  the  rank  order  of  the  explana- 
tory variable  can  be  employed  as  a  useful  instrumental  variable. 

For  the  simultaneous  equation  model ,  the  main  contributions 
are  due  to  Anderson  and  Hurwicz  (1949),  Konijn  (1962),  Zellner 


(1970)  and  Goldberger  (1972) .   They  combined  the  a  priori  infor- 
mation required  to  identify  the  structural  parameters  and  the 
a  priori  information  required  to  identify  the  measurement  error 
variance  to  deal  with  problems  associated  with  measurement  errors. 

In  the  last  three  decades,  most  of  the  investigation  of  the 
errors-in-variables  model  has  assumed  knowledge  of  the  measurement 
error  variance.   In  the  latest  investigations,  attempts  have  been 
made  to  estimate  the  error  variance  itself.   Needless  to  say,  the 
relevance  of  these  empirical  results  should  be  carefully  evaluated 
before  drawing  misleading  conclusions  about  economic  theory. 

In  this  paper,  first  the  effects  of  measurement  error  on  the 
multiple  regression  coefficients  will  be  investigated  in  detail  to 
show  that  (i)  the  quality  of  economic  data  should  be  improved  to 
increase  the  usefulness  of  econometric  research  and  (ii)  the 
standard  estimation  method  should  be  modified  to  identify  the 
estimate  of  multiple  regression  coefficients.   Secondly,  the 
classical  method,  the  mathematical  programming  method  and  the 
grouping  method   will  be  extended  from  the  original  simple  regres- 
sion case  to  the  multiple  regression  case.   In  the  second  section, 
the  basic  multiple  regression  model  will  be  defined  and  the  effects 


The  mathematical  programming  method  refers  the  method  using 
iteration  procedure  to  estimate  the  parameters.   These  three 
different  methods  will  be  self  explained  in  sections  III,  IV  and 
V  respectively. 


of  measurement  error  will  be  investigated  in  detail.   In  the  third 
section,  the  bi-variante  classical  method  will  be  extended  to  the 
multivariate  case.   Both  the  mathematical  programming  method  and 
the  grouping  method  will  be  extended  from  the  vi-variante  case  to 
the  multivariate  in  section  IV  and  V  respectively.   Finally,  the 
results  will  be  compared  and  the  possible  further  research  in  this 
direction  will  be  indicated. 

II.   The  Effects  of  Measurement  Error  on  the  Multiple  Regression 

Coefficient. 

In  this  section,  first  a  basic  model  for  multiple  regression 
will  be  defined.   Secondly,  the  effects  of  measurement  error  in 
the  multiple  regression  coefficient  will  be  investigated  in  detail. 
Following  Moran  (19  71)  ,  a  trivariate  structural  relationship  can 
be  specified  as 


W^  =  a  +  6U^  +  yV^,       "  (1) 


where  W. ,  U.  and  V.  are  unobserved,  but  we  can  observe  Z  =  W.  +  t. , 
111  11 

X.  =  U.  +  e.  and  Y.  =  V.  +  n..   U.  and  V.  have  a  joint  normal  dis- 
111       1111       1 

tribution  with  variances  of  a^  and  a^    and  correlation  coefficient 
p^.   In  the  observed  variables  X,  Y  and  Z,  the  observed  errors  e, 
n  and  T  are  independent  normal  variables  with  zero  means  and 
variance  a^  ,    a'i    and  a^. 

In  addition,  we  also  assume  that  X,  Y  and  Z  have  a  multivariate 
normal  distribution  with  parameters  as  follows: 


(a)  m^  =  E(X) 

(b)  m^  =  E(Y) 

(c)  m-  =  a  +  3m,  +  ym- 

(e)  m^^  =   o^  +   a^  (2) 

(g)  m^Y  =  pa^a^ 

(^^  ^Z  =  ^U  ^  ^P^U^V 

(i)  111^2  =  ^P^u'^V  ^  ^4- 


The  joint  sufficient  statistics  of  m^ ,  m. ,  m, ,  m^y/  "Vy '  ^zz' 
iri^Y'  ^7'  ^7  ^^^  ^®  defined  as: 

(a)  X  =  n""^  E  X. 

1 

(b)  Y  =  n"""*  Z  Y. 

1 

(c)  Z  =  n""""  T.    Z. 

(d)  S^^  =  n"-^  Z  (X^  -  X) 

(e)  Syy  =  n'-^  Z  (Y^  -  Y)  (3) 

(f)  3^2  =  n'^  I  (Z^  -  Z) 

(g)  S^  =  n"^  Z  (X^  -  X)  (Y^  -  Y) 
(h)  Sy.^  =  n"^  Z  (X^  -  X)  (Z^  -  Z) 
(i)  3^2  =  n"-^  Z  (Y^  -  Y)  (Z^  -  Z)  . 

It  is  well  known  that  the  measurement  errors  will  bias  the 
estimate  of  multiple  regression  coefficient,  but  the  results  have 
not  been  carefully  investigated.   Here,  first  the  effects  of 


measurement  error  on  the  multiple  regression  coefficients  of 
classical  errors-in-variables  model  will  be  investigated.   Secondly, 
we  will  relax  the  classical  assumptions  by  allowing  constant 
measurement  errors,  dependence  among  the  true  values  and  measure- 
ment errors  and  dependence  among  measurement  errors .   We  will  call 
this  case  the  "new  classical  case"  to  distinguish  it  from  the 
"classical  case."   To  identify  the  regression  parameters,  without 
the  information  of  the  variance  of  measurement  error,  we  also  try 
to  impose  some  constraints  on  the  regression  coefficients.   This 
will  be  called  the  "constrained  classical  case."   The  results 
obtained  in  this  section  will  be  used  to  explain  why  both  mathe- 
matical programming  method  and  the  grouping  method  should  be 
extended  to  a  multiple  regression  analysis.   The  effects  of 
measurement  error  on  multiple  regression  coefficients  will  be 
investigated  in  accordance  with  the  classical  case,  new  classical 
case  and  constrained  classical  case  respectively. 

From  (2),  we  know  that  X,  Y,  Z,  S^^,  S^ ,    S^^,    S^^'  ^^2  ^'^ 
S    are  joint  sufficient  statistics  of  m,  ,  m_  ,  m.  ,  m  ^  >  ^Y '  ^72' 
m   ,  m^  and  niy^  .   If  the  former  nine  variables  are  jointly  inde- 
pendent, a  set  of  maximum  likelihood  equations  can  be  formulated 
as  follows . 

(b)  S^Y  =0^+0^ 

(c)  3^2  =  ^^'^U  ^  ^'^V   ^  23Yay^  +  al  (4) 
^^^   ^XY  =  ^UV 

^^^   ^XZ  =  ^  ^  Ya^v 


(1)  The  Classical  Case 

From  (4),  the  effects  of  measurement  errors  on  the  estimates 
of  3  and  y  in  the  classical  case  can  be  seen  from  the  following: 


Plim  ;  =   ^-V--2)-wy-  ^-WVW  ,,, 

plim  Y  =  •  (o) 

From  both  (5)  and  (6) ,  the  asymtotic  biases  of  3  and  y   can  be 
defined  as : 


plim  6-3  =  (7) 

(^^J^  -  4^  +  ^U^2  ^  4^1  ^  ^1^2 

plim  Y-Y  =  •  (8) 

The  direction  of  the  biases  of  3  and  y  can  be  treated  according 
to  the  following: 
Under  the  assumption  that  Cov  (UV)  =  0 

i)    a^  =  0,  o^   >    0 


(a)  plim  3-3  =   ^2^^^  ,  ^a^   =  0 


(b)   plim  Y~Y  = 


^U^^  "   ^2^ 


-yol 

(9) 


(9a)  implies  that  6  is  an  asymtotic  unbiased  estimator  of  B; 
(9b)  implies  that  y  is  downward  biased  estimator  of  y- 
ii)   If  aj  >  0,  a^  =  0 

(a)   plim  6-3  - 


4(^5  ^  4) 


^i^^w  -  ^4^ 


(b)   plim  Y-Y  =  ^^(^2  +  ^2) 


=  0.  (10) 


In  accordance  with  (9) ,  (10)  can  be  used  to  draw  some 
conclusions  about  the  biases  of  both  S  and  y- 


iii)  Finally,  if  a?^  r  0,  a^  >  0 


(a)   plim  6-3  =  - 


-u^"i 


(b)   plim  Y-Y  = .  (11) 

In  this  case,  the  biases  of  6  and  y  are  identical  to  those 
biases  of  the  coefficients,  which  are  obtained  from  regressing  Z 
on  X  and  Z  on  Y  respectively. 


Suppose  now  that  Gov  (UV)  7^  0 


i)    a^  =  0,  a|  >  0 


2 
(a)   plim  3-e  =  yh. 


^  oi,  ^   ajd  -  Rj^) 


(b)   plim  Y-Y  = /      _  (12) 

where  b   is  the  auxiliary  regression  coefficient  of  a  regressing 
V  on  U,  and  R?-      is  the  correlation  coefficient  between  U  and  V. 
(12a)  implies  that  the  direction  of  the  bias  of  3  depends  upon  the 
sign  of  both  y  and  b^. ;  (12b)  implies  that  y  is  a  downward  biased 
estimator  of  y  unless  (a^  -  b   +  a|)  is  smaller  than  zero. 

ii)   If  a^  >  0,  a^  =  0 


(a)   plim  3-6  = 


4  -  ^UV  ^  ^1 
(b)   plim  T-Y  =  h^     — -^ -_ (13) 


^i  -^  ^^u^i  -  ^V^ 


where  b   =  the  auxiliary  regression  coefficient  of  regressing  U 

on  V. 


iii)  If  a?^  >  0,  a^  >  0 


a 
(a)      plim  6-6   = 


U 


(b)       plim  Y~Y   =  22,22 

a„a,    +  a, a 


(14) 


2     u     ^  .2  ,  v^i  ■  "r2 


V 

From  (14) ,  we  can  see  that  the  direction  of  the  biases  of  both  6 
and  Y  are  ambiguous . 

An  example  may  be  useful  at  this  point.   Suppose  that  we  wish 
to  determine  whether  a  real  term  or  a  nominal  term  should  be 
employed  to  estimate  the  parameters  of  the  money  demand  function 
when  only  the  deflator  is  subject  to  measurement  error.   Suppose 
we  have  a  money  demand  function  defined  as 

M.     Y. 

—  =  ( — )  e  1  (15) 

P.     P. 
1      X 

where  M.  =  money  demand, 

P.  =  wholesale  price  index, 
X  ^ 

a   =  constant. 


w^  ,  (0,  0^)   , 

P.  is  unobserved,  but  we  can  observe  P'  =  P.e..   By  logarithm 
X  X     X  X     -^     ^ 

transformation,  an  observable  money  demand  can  be  written  as 
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log  M.  =  alog  Y^  +  (1-a) [log  P^  +  log  e ^]    +   w^        (16) 

If    (16)     follows    the   assumptions   of   classical   errors-in-variables 
model,    then   a   criteria    for   determining  whether   the   real    term  or 
the   norminal    term  is    to   be  employed   in   this   kind  of   money   demand 
function   can   be   derived  as    follows: 

i)    if  log  P.  is  ommitted,  the  specification  bias  of  the  estimate 
of  a  can  be  written  as : 

1-      "             /I       \    cov(log  P,    log  Y)  -,_. 

plim  a-a   =    (1-a)    ^ ^ ~ (17) 

var    (log  Y) 

ii)   if  log  P.  is  included,  the  asymptotic  bias  of  the  estimate 

of   a   is 

(1-a)    var    (log   z  . ) 

pliiri  a-a  =   — — — ■ (18) 

var (log  Y)    +   var (log   e.) 

From  (17)  and  (18) ,  it  is  clear  that  the  real  term  instead  of 
the  nominal  term  should  be  employed  to  estimate  the  parameter  of 
money  demand  function  if 


covdog   P,    log   Y) 


var  (log   e) ^^g^ 

var (log  Y)    +   var (log   e) 


var (log  Y) 

Similarly,  for  a  multiple  regression,  the  criteria  for  either 
including  or  excluding  price  index  in  the  regression  can  also  be 
drawn  by  specification  and  errors-in-variable  analysis. 

(2)   The  New  Classical  Case 

We  will  investigate  the  effects  of  measurement  error  on 
regression  coefficients  for  both  simple  and  multiple  regressions 
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when  the  classical  assumptions  are  dropped.   For  simple  regression, 
we  will  also  derive  the  effects  of  constant  measurement  error  on 
the  estimate  of  the  intercept  in  addition  to  investigating  the 
effects  of  random  measurement  error  on  the  estimate  of  the  slope; 
for  multiple  regression,  only  the  effects  of  random  measurement 
error  on  the  regression  coefficients  will  be  investigated. 

For  bi-variate  normal  case,  if  the  classical  assumptions  are 
dropped,  then  following  (1),  we  have: 

(a)  m  =  E(X)  =  E(U)  +  I 

(b)  m2  =  E(Y)  =  a  +  eE(U)  +  ^ 

(c)  m^  =  var(X)  =   a^    +   a^   +   2a^^ 

(d)  m^Y  =  var(Y)  =  Q^a^    +   o^    +    2Qa^^ 

(e)  m^Y  =  cov(XY)  =  ga^  +  a^^  +  a^^    +    o^  (20) 

where 

£  is  the  constant  measurement  error  of  U, 
n  is  the  constant  measurement  error  of  V, 
a^j    ,    c?      ,    o^     and   a        are    covariance    for   relevant   variables. 

From    (20) ,    we   can   obtain   the   asymptotic   biases    for   both    3    and   a   as 


+    a,,      +    a        -    2Ba„      -    So 


2 


(a)  plim   8-6   =   "^^    '    ""^    '       "" ^ ^  =    « 

(b)  plim   a-a   =    {e    -    Bri)    -    <t>[E{V)    +   ~]  .  (21) 
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The  differences  between  t±ie  asymptotic  biases  of  the  classical 
case  and  the  new  classical  case  have  following  differences. 

(i)   In  the  classical  case,  the  estimate  of  the  slope  will  be 
biased  toward  zero  only  when  the  independent  variable  is 
measured  with  error;  in  the  new  classical  case,  having  either 
independent  or  dependent  variables  measured  with  error  is  a 
sufficient  condition  to  bias  the  estimate  of  the  slope. 

(ii)  Under  the  classical  case,  measurement  errors  will  never  change 
the  sign  of  the  slope,  but  in  the  new  classical  case,  the  sign 
of  slope  can  be  changed  due  to  measurement  errors,  i.e., 
plim   ~    does  not  necessarily  lie  between  -1  and  0. 

(iii)The  intercept  has  a  higher  probability  of  being  affected  by 
measurement  errors  in  the  new  classical  case  than  in  the 
classical  case. 
For  multivariate  normal  case,  (5)  and  (6)  can  be  rewritten 

as  : 


plim  B  = —  \^'^i 

plim  Y  =   ""'  \^^) 
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If  only  V  is  measured  with  error,  then  the  asymptotic  bias  of  3 
can  be  written  as 


,  .   „  „       WU  2      U  2     Uri 
plim  6-3  = 


(^u4  -  ^w^  +  <^S'^2  -  ^un  ^  ^^h''v^ 


(24) 


Under  this  circ\imstance ,  if  the  variable  measured  with  error  is 
dropped  from  the  regression  analysis,  the  specification  bias  of 
3  is  yh  .  The  relative  magnitude  between  Yt>„,  and  (plim  3-3) 
can  be  analyzed  as  follows : 

2 

(i)       specifically,    if   Rtt     =0   and  b        =    0, 

then    (plim   3-6)    <    Yb^,  (25) 

(ii)  in  general,  (plim  6-3)  <  Yb  r^  if  and  only  if 

^''^     ''™         <  1.  (26, 


:i  -  R,^.    +  2b  .,)  +  -^ 2Y. 

TJn    nu       2 

°2 


Incidentally,  both  McCallum  (19  72)  and  Wickens  (19  72)  have 
drawn  some  conclusions  about  the  trade  off  between  the  asymptotic 
bias  and  specification  bias  of  3.   It  can  be  noted  that  their 
conclusions  are  only  a  special  case  of  (26) . 
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(3)   Constrained  Classical  Case 

If  we   impose    3   +   Y   =   1/    on   the   regression    coefficients   of    (4) 
and   for   simplicity,    assxime   only   that   V  is   measured  with   error,    then 
(12a)    should  be   rewritten   as 

oUl   -   6) 
plim  3-6   = .  (27) 

The  asymptotic  bias  of  this  case  can  be  analyzed  as  follows: 

(i)   If  3  >  0,  then  3  is  unbiased  when  6=1,  3  is  upward  biased 
when  6  <  1  and  6  is  downward  biased  when  6  >  1. 

(ii)  If  3  <  0,  then  6  is  always  upward  biased. 

III.   The  Extension  of  Classical  Estimation  Method  to  a  Multiple 
Regression  Analysis 

(1)   Classical  Method 

From  (4) ,  it  is  clear  that  if  p  =  0 ,  then  U  is  orthogonal  to 
V.   It  is  well-known  that  this  multiple  regression  reduces  to  two 
simple  regression  relationship.   If  p  7^  0  then  to  identify  3  and  Yr 
we  need  to  know  either  the  actual  values  of  a^ ,  a^  and  a^  or  the 
relative  ratios  among  o^  ,   a^   and  a^.      We  will  investigate  the 
following  cases : 

(i)   02  =    0,    a^   >    0,  a|  >  0,  and  a^   =    Xa^ 
(ii)  G^    >    0 ,    o^    >    0    and  o^    >    0 
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a)  a^    and  oi    are  known 


b)  a^   =   ^tCT^  and  a^  =  ^o^i' 


(Case  i)   Only  Z  and  X  are  measured  with  errors. 


Since  a^  =  0,  from  (4  c) ,  (4  e)  and  (4  f)  we  have 


(Sy2  -  3S^y) 
Y  =  c •  (28) 


Substituting  (28)  into  (4  e) ,  give  us 


a^  =  -X (29) 

3  ^^YY 


From  X  =  — ,  we  also  have 

'I 


"I  =  r'^xx  -  "u' •  "<» 


Substituting  (29) ,  (30)  and  (4  c)  and  (4  d)  into  (4  a)  and 
rearranging 

S   S  S^  S^  S   S 

-hx[S^-    ^X_YZ.^  +  f_(2      XY^+X(S_-  -^)]e+(S„  -  f^    ^^)  =  0. 
YY  YY  YY  YY 

(31) 
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The  solution  of  (31)  can  be  derived  as  follows 
Let 


XY  YZ 


(^xx-^)  -^(^zz-S^)  =^1 


Then  (31)  can  be  written  as 


k^B^  +  k^  3  +  k  =0  (32) 

2       1      O 


There  are  three  cases  to  consider: 
(a)   X  ^  0  when  a^    ■*■   0,  then  in  this  case  from  (28),  we  know  that 

2    _  "^XY^YZ  "^  ^XZ^YY  (33 J 


c   c    —  c;  2 
^XX^YY     XY 


(b)   \  ^  00  when  al^  ^  0 ,    in  this  case  from  (31)  ,  we  know  that 
:     ^ZZ^YY-  ^YZ    ^  (34) 


^XZ^YY    ^XY^YZ 


(33)  and  (34)  correspond  to  those  of  simple  regression  case, 
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(c)      When  both   a^    >    0    and   a^    >    0,    then 


k,    +      k,2    +    4Xk2 

e.    2    =      ^    -  ^ °.  (35) 

^'^  2Ak 

o 

Whether  S,  or  6^  should  be  chosen  still  remains  as  an  open  question. 
When  3  is  determined,  y  can  be  estimated  by  (28) .   After  both  6 
and  Y  are  estimated,  then  a  can  be  estimated  by 


a  =  Z  -  6X  -  yY.  (36) 

(Case  ii)   When  Z,  X,  and  Y  are  all  observed  with  errors 

(a)   both  a?  and  ai    are  known. 

By  (4  b)-(4  f ) ,  we  can  obtain  the  two  normal  equations  as  follows 


(^')  2^Z  =  ^^^XX    -    ^V     -^  ^^XY 

(b')  3^2  =  ^^XZ    ^    ^^^YY  "  ^2^-  ^^'^' 


Solving  (37)  by  Cramer's  rule  we  have 


:    _  ^XZ^YY  "  ^XY^YZ  "  ^XZ  ^2 

P   —  "  ~  "  ■  "A   "  ~  '  "  ■  "     A"     ■  ■       ■   A" 


^XX^YY  -  ^i^YY  -  '^2SyY  +  ^i'l    "  ^^XY^ 


^    _  ^YZ^XX  "  ^XZ^XY  "  ^YZ'^I ,,„, 

^XX^YY  "  "^l^YY  ~  ^2^XX  ^  *^1^2  "  ^^XY^  ^ 
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(b)   both  a^  =  ^-,0?  and  al    =  X_a^  are  known. 
From  (4  a) ,  (4  e)  and  (4  f ) ,  we  have 


From  (4  b)  and  (4  e)  we  have 

Substituting  (40)  into  (39)  and  rearranging,  we  have 

-   ^    ^^[^ZZ  "  ^2^XX^  "  ^'^XZ  "•  ^2^XZ^  (4^j 

^YZ  ^  ^2^XY 

From  (4  b) ,  (4c),  (4  e)  and  (4  f)  we  will  obtain 

B^s^^-BSyg  +  y^^^^YY~^l^XX^    "^  ^l^XZ^'^^^l^XY  "  °-      ^'^^^ 
Substituting  (41)  into  (42)  and  rearranging,  we  have 

(^'^XY-^^YZ^  (^SyZ^^2SxY^  '+[^T-32s^2  +  X2S^2^  [m+X^S^^] 


^^^YZ^^l^XY^-h^XY^^'^-^'^XZ^h^XZ^"    =  °'   (^^) 


where  T  =  S^^  -  ^2^XX 
M  =  S^Y  -  ^i^xx- 

After  rearranging,  (43)  will  become  the  following  cubic  equation 

/v  -«.  /v 

B^H^  +  6^H2  +  6H^  +  H^  =  0,  (44) 
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where 

\    =  ^"2^iY-^^2^XY^iz'-'^h^XY^^2^XZ^YZ^^2''^YZ^XZ 

"^1^2^XZ^XY'^^^1^XY^XZ"^1^^^XY 
^o  =  ^|SyzSJy+A^S^22^+^1^2SJzSyZ-^^H^2SxzSyZ-2^1^2TSxz2xY- 
When  ^2=0,  (44)  will  reduce  to  a  quadratic  equation  in  3. 

(2)   Constrained  Classical  Method 


Under  the  classical  case  (Case  ii) ,  if  we  only  know  a-  =  a,X, 
then  we  can  identify  3  and  y  by  imposing  B  +  Y  =  1. 
From  (4  b)  and  (4  c)  we  have 

"4  =  -^^U    -  ^YY  -^  ^^XX'  (45) 

also  from  (4  d)  ,  (4  e) ,  (4  f)  and  (45) ,  we  have 

3^2  =  (1  -  3)  i-Xa^   -  ^YY  +  ^S^^)    +    33^^  (46) 

^XZ  =  K  +  (1  =  ^^^XY-  (47) 

From  (45)  and  (47) ,  we  can  obtain  a  quadratic  equation  in  3 

?{{1-X)S^^-S^    H{-S^^-XS^^^2XS^^^S^^-XS^^)^XiS^^-S^^)    =    0. 

(48) 
When  A  =  0,  then  (48)  will  reduce  to 
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i.e.  ;   =    flL^iM  (49) 

^XY     YY 

Imposing  Q   +   y   =   1 ,    upon  a  multiple  regression  will  help  to 
identify  the  regression  coefficients,  but  it  should  also  be 
realized  that  the  constrained  regression  technique  will  bias  the 
estimates  of  the  regression  coefficients  if  the  unrestricted 
estimator  fails  to  satisfy  the  restriction  Q   +   y   =   1.      The  advan- 
tages and  disadvantages  of  the  constrained  regression  technique 
has  been  discussed  by  Theil  (19  71)  in  some  detail. 

IV.   The  Mathematical  Programming  Method 

The  approach  involves  the  estimation  of  the  parameters  of  a 
fTinction  conditional  on  the  maximum  likelihood  function  adjusted 
for  the  true  values.   This  method  is  different  from  the  classical 
method  as  follows : 

(i)     The  variances  of  measurement  errors  for  every  observation 

are  different, 
(ii)    A  weighted  regression  method  is  applied, 
(iii)   An  iteration  procedure  is  used  to  obtain  a  consistent 

estimator. 

It  can  be  proved  that  the  mathematical  programming  method  reduces 
to  the  classical  method  under  certain  conditions. 
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The  principal  contribution  I  make  here  is  to  extend  the 
bi-variate  mathematical  programming  method  (which  was  developed 
by  Deming  (1943),  York  (1966)  and  Clutton-Brock  (1967))  to  a 
tri-variate  case. 

Following  York  (19  66),  we  define  W(Z.),  W(X.),  and  W(Y.) 
which  are  the  weights  of  the  various  observations  of  Z.,  X.  and  Y.. 

It  is  assumed  W,  U,  and  V  are  functionally  rather  than  structurally 

2) 
related.     Also,  the  adjusted  values  of  Z.,  X.  and  Y.,  based  upon 

the  measurement  error  variances  of  every  observation  and  maximum 

likelihood  procedure  are  defined  as 


A  B-  +  A,B 

X,  =  ^-^ ^  (5o: 

A^B^  -  A^B^ 


A,B^  +  A^B_ 
y       =  J^ ^  (51) 

^1^2  -  ^2^1 


z^  =  a  +  6x^  +  YY^/  (52) 

(i=l,2,. . . ,N) 


where  A_  =  a^ .  +  3^a^ . 

2       Tl         £1 

^1  =  ^^'li 


A  =  a^ .X.  +  aBa^ .  -  Ba^  .Z  . 

O  Til         ni        £11 

B„  =  Qya^  . 

2  ni 


2) 

The  differences  between  the  functional  and  the  structural 

relationship  can  be  found  in  Grabill  (1961) ,  Kendall  and  Stuart 
(1961)  and  Moran  (1971)  . 
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1    Ti    '   ni 

B  =  a^ .Y.  +  a3a^ .  -  ya^  . Z . , 
o    Ti  1      ni     r|i  1 


The  mathematical  programming  procedure  begins  by  minimizing 
S  =  S^{W(X^) (x^-X^)2+  W(Y^) (y^-Y^)2+  W( Z^) ( z^-Z^) ^ }    (53) 
S.t.    z.  =  a  +  6x^  +  Yyj_.  _  (54) 


Totally  differentiating  (53)  and  (54) ,  we  have 


6S  =  E^{W(X^) (x^-X^)6x^  +  W(Y^) (y^-Y^)6y^ 

+  W(Z^) (z^-Z^)5z^}  (55) 


-5z.  +    5a  +   x.5b  +  B6x^  +  y^Sy  +  Y^y^  =  0,  (56) 


where  6  is  the  operator  of  total  differentiation.   Let 
(55)  +  X.  (56)  =  0,  we  now  have 

I^{6x^[W(X^) (x^-X^)+6X^]}+  Z^{6y^[W(Y^) (y^-Y^)  +  yX^] } 

+Z^{6z^[W(Z^) (z^-Z^)-X^]}+6aZ^X^+6BZ^X^x^+6YS^X^y^  =  0 

(57) 
Equating   the   coefficients    to    zero   gives 

-    X.6 

(^^      ^^i  -  ^i^   =  wTxTT 

-  x^e 
^^^      (^i  -  ^i^   =  W(T7T  (58) 

X. 
(c)        (z,    -    Z.)    =         ^ 


i  i'         W(Z.) 
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(a)  l.X.    X.  =0 

111 

(b)  Z.A.y.  =  0  (59) 

1  I-'  1 

(c)  Z.X.    =    0. 

11 


Substituting  (5  8)  into  (54)  yields 


A^  =  K^(a  +  6X^  +  yY^   -   Z^)  (60) 


where 


W(X^)W(Y^)W(Z^) 
^i  "  W(X^)W(Y^)  +  B"W(Z^)W(Y^)  +  Y^W(Z^)W(X^)  ^^^^ 

From  (59  c)  and  (60) ,  we  have 

E.K.(a  +  BX.  +  yY.  -  Z.)  =  0  (62) 


From  (59  a),  (59  b)  and  (60),  we  have 

Z  .K.  {aX.+6X?+YX.Y.-X.  Z.  }-Z.K?{fr7|-r    (a+BX . +yY  . -Z  .  )  ^  }    =    0. 
11         ii'ii      11         11   W(X.)  1     '     1      1 

(63) 
Z.K.{aY.+BX.Y.+YY^-Y.Z.}-SKj{^^^    (a+BX. +yY. -Z  .  )  M    =    0 

(64) 
From    (62) ,    we   have 


a   =    Z    -   bX   -    AY,  (65) 


I.X.K.  I.K.Y.  ZK.Z. 

u  rr  111^  111  •,•=■  11 

where    X   =      -.    ^ — ,    Y   =      „    ^ — ,  and    Z    =   —t;:^^ — , 

"i    i  "^i    i  "    i 
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If  we  let  P.  =Z.  -Z,Q.  =X.  -X  and  R.  =  Y.  -  Y,  and  substitute 
1111  11 

them  into  (60)  and  (61) ,  we  have 

ZK.CBQ?  +  YQ.R.  -  P.Q.}  -  2iK.{^^  (  gQ .  ^.^r  .  _P  . )  a  j  .  0. 

(66) 

ZK^{eQ.R.  +  YR?  -  RiPi)  -  2K2{^^(BQ^  +  YRi  -  Pi)M  =  0. 

(67) 
From  (66)  and  (67) ,  two  sixth  power  equations  in  6  and  y  can  be 
obtained.   These  equations  have  to  be  solved  numerically. 
If  Y  is  measured  without  errors,  then  (67)  reduces  to 

ZK.CSQ.R^  +  yR?  -  P^Ri^  =  0-  (68) 

From    (68)    and    (66),    we   can   obtain 

B^C^    +    26^C2    +    BC^   +   C^   =    0,  (69) 

K?  R.ZK.Q.P.    ^ 

where      C,   =    Z.    (Q .    -   ) 

■^  W(X.  )  ZK.R? 

1  11 

K?  ZK.Q.R.  IK.R.P. 

C-    =    Z.    —^=—    (Q.-R.  ^    ^    ^)  (P.    -    R.  ^  ) 

W(X. )  ZK.R?  ZK.R? 

1  11  11 

(Z.K.Q.R.)^  K?  R. ZK.R. P. 

C,    =    ZK.P? i-J^-J^ Z(— i— )  (    ^       ^    ^    ^    -    P.)^ 

^  ^    ^  Z.K.R?  W(X.)  ZK? 

Ill  1  1 

ZK.Q.R.ZK.P.Q. 
C      =         ^    ^    ^      ^    ^    ^    -    ZK.P.Q.. 

°  ZK.R? 

1    1 
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When  Y.  is  orthogonal  to  X.,  (58)  reduces  to  the  "Least  Square 
Ciibic"  of  York  (1966).   Under  some  special  assumptions,  e.g., 

a^ .  =  a^,  a^ .  =  a^  and  a^  .    =   o^  ,    the  solutions  of  6  in  (66)  can  be 

Ti    T   £1    e      ni    n 

shown  to  be  similar  to  what  we  have  derived  in  last  section. 

This  extension  will  reduce  to  Deming's  (19  43)  weighted  regres- 
sion results  when  the  quadratic  term  of  (66)  and  (67)  are  omitted. 
However,  my  result  is  more  general  than  Deming's  weighted  multiple 
regression  analysis. 

V.   A  Generalization  of  Wald ' s  Method  of  Fitting  Straight  Lines 

The  relationship  between  the  grouping  method  and  OLS  can  be 
demonstrated  by  applying  OLS  to  either  instrumental  variable  or 
grouped  data. 

Gibson  and  Jowett  (195  7)  and  Hooper  and  Theil  (195  8)  have  used 
the  concept  of  "Centre"  (by  taking  subgroup  means)  to  extend  Wald's 
method  of  fitting  straight  lines.   However,  Gibson  and  Jowett 's 
method  can  not  be  used  to  treat  a  model  when  independent  variables 
are  subject  to  observation  errors,  since  their  standard  errors  for 
regression  coefficients  are  obtained  under  the  assumption  that  the 
independent  variables  are  observed  without  errors.   The  Hooper  and 
Theil  approach  can  be  shown  to  reduce  to  Gibson  and  Jowett' s 
approach . 

Here,  a  new  approach  is  developed  which  is  based  upon  specifi- 
cation analysis.   This  method  is  useful  for  econometric  research, 
since  it  has  the  advantages  of  the  Gibson-Jowett  approach  but  not 
its  limitations.   The  main  purposes  of  this  sections  are:   (i)  to 
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derive   a   stepwise   grouping  method  which   can    treat  n-variate 
regression   problem  and    (ii)    to   derive   an   asymptotic   confidence 
interval    for   the   regression    coefficients.      In   effect,    I   generalize 
the   Wald   grouping  method  here    to    the  n-variate    case. 

Gibson   and   Jowett    (1957)    have    applied  Wald's   bi-variate    group- 
ing method   in   estimating   the   multiple   regression    coefficients   when 
the   independent   variables    are   not    subject   to   errors.      For   simplicity, 
and  without   loss    generality,    we   assiime    that    the   intercept  of    (1)    is 
zero,    then   Gibson   and  Jowett 's   method   can   be   described  as    follows: 

By  specification  analysis,  we  have 

(a)  b  =  6  +  b^Y 

(b)  r  =  y  +  b^3,  (70) 

where  b  =  (U'U)  ""'■U'W, 
r  =  (V'V)"-^V'W, 
b^  =  (U^U)"^U'V, 
by^  =  (V^V)"^'U.  (71) 

From  (70) ,  both  0  and  y  can  be  estimated  as 


b-rb 
(a)   6  =  ^ 


1-b   b 
vu  uv 


r-bb 

(b)   Y  =  ^^  (72) 

1-b   b 
vu  uv 
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Gibson   and   Jowett  einployed   a  bi-variate    grouping  method   instead  of 
OLS   to   estimate   b,    r,    b        and  b        to    simplify   the   calculation   proce- 
dure.     If  both   u   and   v  are   measured  without   error,    then  both   OLS 
and  Wald's    grouping  method   can   be   used   to   obtain   the   consistent 
estimators    for   b,    r,    b        and  b      ;    if   both   u   and   v  are   measured 
with   errors,    then  only  Wald's    grouping  method   can   be   used   to    the 
consistent  estimators    for    (71) . 

Now,   we  will   derive    an   alternative   method  when   independent 
variables    are   either  measured  with   or  without   errors. 

Case    (A)    -   When   independent   variables   are   not    siibject   to   errors. 
In   this    case,    by    substituting  w  =    z   -    x   into    (1)    we   have 


z.    =    Bu.    +   Yv.    +    T.  (73) 

1  1         '    1  1 


If  we  multiply  both  sides  of  (71)  by  an  idempotent  matrix 
M-  =  I  -  v(v^v)   v' ,  then  we  have 


M^Z  =  M^uS>   +   M2T,  (74) 


where 

(a)  M-Z  =  Z  -  v(v'v)~  v'Z  =  Z  -  vr 

(b)  M^uB  =  [u-v(v'v)"-'-v'u]6  =  (u-vb^^)e  (75) 


After   estimating    r   and   b        by   bi-variate    OLS    for    (72) ,    we    can 
^  uv 

substitute     (75)     into     (74)     to   obtain 

Z*   =    u*B    +    T*,  (76) 
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where 

Z*  =  z  -  vr, 

u*  =  u  -  vb   ,  and 
uv' 

T*  =  M2T. 

From  (76) ,  we  can  obtain  the  consistent  estimate  of  6  by  employing 
the  bi-variate  OLS . 

Similarly,  we  can  obtain 

Z*  =  v*Y  +  T*,  (77) 


where   Z*  =  z  -  ub. 


V*  =  V  -  ub   , 
vu 


T*  =  M^T    and 

M.  =  I  -  u(u'u)   u' 


From  (77) ,  we  can  obtain  the  consistent  estimate  of  y  by 
employing  the  bi-variate  OLS. 

Case  (B)  -  When  independent  variables  are  subject  to  error. 

If  both  u  and  v  are  also  measured  with  errors,  then  these 
observation  errors  will  affect  the  stepwise  OLS  method  in  four 
different  (but  separable)  directions;  i.e.,  they  affect  (i)  the 
estimate  of  b  and  r,  (ii)  the  estimate  of  b    and  b   ,  (iii)  the 
values  of  u  and  v  individually,  and  (iv)  the  estimates  of  3  and  Y' 
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If  b,  r,  b   ,  b   ,  S  and  y  are  estimated  by  Wald's  bi-variate 
uv   vu 

grouping  method  instead  of  OLS  method,  then  the  consistency 
property  for  these  estimators  can  be  preserved.   Now,  we  will 
prove  this  argument.   For  simplicity,  we  will  only  deal  with  the 
estimation  procedure  of  3.   Following  (75),  a  stepwise  grouping 
method  (SG)  can  be  employed  to  replace  the  stepwise  OLS  to  obtain 
the  consistent  estimator  of  3  in  accordance  with  following  three 
steps : 

(i)   Ordering  Y  in  ascending  order  and  deriving  Y  and  Z  into  two 
equal  groups ,  we  have 


^1  "  ^2 
r  =  


^1  -  ^2 


_  _       _  _      _i   rti      ri 
6(u,-u_)  +  Y  (v, -V  )  +  m   (  Z  x.-   Z  t  ^] 
i=l    i=m+l 

(v  -V  )  +  m~^(  En.-  Z  n.) 
■^   ^        i=l  ^  i=m+l  ^ 


(78] 


where  m  =  j  n. 

Since  the  variance  of  the  second  terms  in  the  numerator  and  denomi- 
nator are  0  (m   )  ,  then 

plim  r=Y+b   e=r.  (79) 

^  uv 

m-><» 
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(ii)   Ordering  Y  in  ascending  order  and  dividing  Y  and  X  into  two 
equal  groups ,  then  we  can  obtain 


b   =  — 

i^v    Y   -  Y 
^1     2 


^ iSitl .  (80) 

_  _      _,   m      n 

(v,-v-)  +  m   (  Z  n  ■-  Z   n  • ) 

^   ^        i=l  ^  i=m+l  ^ 


Since  the  variances  of  the  second  terms  in  the  numerator  and  denomi- 
nator are  0 (m   ) ,  then 


plim  b   =  b  (81) 

'^  uv    uv 

m-»-<=° 


Substituting  u,  v,  r  and  b   of  (81)  by  X,  Y,  r  and  b   , 
we  have 


(a)   Z*  =  Z  -  rY,  =  (W  -  vr)  +  (t  -  nr)  =  W*  +  t* 


(b)   X*  =  X  -  Yb^  =  (u  -  vb^^)  +  (£  -  nbj^Y^  =  u*  +  e*. 

(82) 

In  (72) ,  W*  and  u*  are  true  values  for  the  observable  values  Z* 
and  X*.   Following  (76),  the  true  relationship  between  W*  and  u* 
can  be  defined  as 

W*  =  u*B.  (83) 


To  estimate  Q,   we  should  substitute  Z*  and  X*  for  W*  and  u*. 

Now,  we  will  examine  whether  (81),  (82)  and  (83)  satisfy 
Wald's  five  assumptions. 


3i 


(a)  plim  (e'i'e^)  =  plim  (e.e.-e.ri-b   -e-n-b   +ri-ri-b^  )  =  0. 

/v  /s  /\ 

(b)  plim  (T^ffr)  =  plim  (x.T.-x.n-r  -  t-n-r  +  n-n-r^)  =  0. 

(c)  plim  (e'f'T'f')  =  plim  (e.x.  -  e  .  n  •  r  -  n  ■  t  .  +  r)?b   r)  =  a^b  r. 
^  '  ^      11    '^  11    11     11    1  uv      ri  uv 

(d)  (83)  is  a  single  linear  relationship. 

(e)  The  limit  inferior  of 


m 

n 
-EX* 
i=m+l    ^ 

m 

(n  =  2,  3, 


ad.  inf.)   (84) 


is  positive . 

Since  the  X*'s  (i=l,  ...,  n)  are  in  ascending  order. 


(iii)   Ordering  X*  in  ascending  order  and  dividing  X*  and  Z*  into 
two  equal  groups  respectively,  then  we  can  obtain 


6  = 


7*  _  7* 

21 _2_ 

X*  _  X* 
1     2 


3 (u*-u*)  +  m 


-1 


m 
(  E  Tf  - 
i=l  ^ 


n 

E 

i=m+l 


T?) 


-1 


m 


n 


(85) 


(u*-u*)  +  m   (  E  e*  -    E   e*) 
12         i=i  1    i=^+i  1 


Following  the  arguments  in  (81)  and  (79) ,  we  can  conclude  that 


plim  3=6 
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For  t±ie  two  independent  variables  case,  we  need  to  employ  a 

three  step  bi-variate  grouping  method  to  obtain  a  consistent  esti- 

k 
mator  for  each  regression  coefficient.   In  general,   Z   i  steps  are 

i=l 
required  to  estimate  each  regression  coefficient  for  a  k-indepen- 

dent-variables  multiple  regression. 

The  estimators  of  the  variance  of  e*  and  n*  can  be  derived 

as  follows: 

(i)   The  properties  of  the  relevant  statistics,  S^*,  ^t* '    ^w*' 


(a)   plim  (S^^)  =  a^^   +   a^, 


(b)  plim  (S^^)  =  0^^   +   of 

(c)  plim  S^*  =  6'aJ^ 


(d)  plim  S^*^*  =  BaJ^ 

(e)  plim  S^^x*  =  '^w*u*+  ^nV^ 


(86) 


;ii)   From  (86) ,  we  can  obtain  consistent  estimators  for  a|^  and 


a^+  as 

T* 


2 


^  S_j.,,j.   yb  u 

n^     -   rc;2   -  ^  .^     +   ^y  ^1 

^      ^      B       6 

where  S^^  =  sample  variance  of  Z*  observations, 
S^^  =  sample  variance  of  X*  observations, 
S^^„j.  =  sample  covariance  of  X*  and  Z*  observations. 
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The  Asymptotic  Confidence  Interval  for  6  can  be  derived  as 
follows : 

Following  Wald  (1940)  ,  we  define 


m            _  n               _ 

Z     (X*-X*) 2    +  L       (X*-X*) 2 

re-    ^2   _   i=l      ^      ^  i=m+l      "■     ^ 

n 


(88) 


m  n 


E    (Z*-Z*) 2    +        Z       (Z*-Z*) 2 
1      1  •    _ .  -.       1      2 


^    >  2    _   i=l  i=m+l 


(s:*)^  = 


n 


m  n 


^X*Z* 


Z     (Z*-Z*) (X*-X*)     +         Z        (Z*-Z*) (X*-X*) 
i=l       ^      ^         ^      ^  i=m+l      ^      ^         ^      ^ 

n 


m  n 

EX*  EX* 

T         1  -111 

,  TTj.        1=1  —J.        i=in+i 

where     X*   =  ,  x*  =  

•^  m  ^  m 


m  n 

E    Z*  E       Z* 

z*  =  i^^,  z*  =  i^2±i_i, 

m  m 


Under  the  probability  limit  sense,  following  Hogg  and  Graig 
(1969),  the  statistics  S^ . ,  S"^  and  S^^  ^  are  distributed  inde- 
pendent  of  X*   X*,  z"*  and  z"* 
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Now  we  shall  show  that 


..in  K  (B-B) (39) 


/N 


(S^*)^  +  BMSjJJ^  -  2BS2\x* 


is  asymtotically  normally  distributed  with  zero  near  and  xxnit 
variance,  where  K  =  X*  -  XJ,  then  from  (79),  we  have 


K(B-B)  =  (Z*  -  Z*)  -  BK 

=  [(Z*  -  W*)  -  (Z*  -  W*) ]  -  B(F*  -  I*) 

=  (T*  -  T*)  -    QU*   -   e"*)  .  (90) 


If  both  T*  and  e*  are  normally  distributed,  then  from  (88) 
and  (90)  ,  we  can  show  that 

,[n   K(6-6)  ~  AN(0,  a^)  ,  (91) 

where 

a2  =  (CT^   +  B^T^  .  -  26b  ra^) . 
T*      e*      uv  n 

similarly,  from  (84),  (85)  and  (86),  we  have 


(Z|  -  BX|)-(Z*  -  BX*)  =  (t|  -  T*)  -  6(£|  -£*) 

(i=l,  .  .  .  ,  m) 


and   (Z*  -  BX^)  -  (Z*  -  BX*)  =  (t^  -  x*)  -  B(e|  -  £^) •      (92) 

(i=m+l ,  , . . ,  n) . 
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From  (88) ,  we  also  have 


3(S^^)^  +  (S^*)'  -  2S^^2* 


p  m  _   _         n  _   _ 

)   Z  [(Z:f-6X^)-(Z*-6X*)  ]2  +    I       [(Z^-6Xt)-(Z*-6X*)  ]2 

n  X  i=l  ^ i=m+l   ^   ^    ^    ^ 

n-2  1  n  J 

(93) 
From  (92)  and  (93) ,  we  have 

plim  {(S^*)^  +  8'(Sj^*)'  -  2BSj^*2*^ 

■=  ^^*  +  e'a^^  -  2Bb^^ra2  (94) 


Following  (91)  and  (9  4) ,  we  have 
..jlT  K(8-B) -. 


AN(0,  1)  (95) 


'^'^x*  "^  ^z*  ~  2^^z*x* 


Denote  by  z   the  critical  value  of  standarized  normal  dis- 
-^   o 

tribution  to  a  chosen  probability  level,  then  the  deviation  of  ( 
from  an  assured  population  value  6  is  significant  if 


K(S-S)  ,-;n 


-.  S^l   +    S^S^^  -  26S2*^* 


>  z  (96) 

—  o 


(96)  implies  that 


A^B^  -  2A^S  +  A   =0,  (97) 
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where 


n 


n 


n 


Finally,  the  standard  error  of  6  can  be  obtained  by 


z^  (i=l,  2)  (98) 


where  Q'^   =   the  root  obtained  from  (97)  . 


V.   Conclusion 

The  classical  method  is  a  special  case  of  mathematical  pro- 
gramming methods;  the  grouping  method  is  a  special  case  of  the 
instrumental  variable  approach;  the  relationship  between  the 
grouping  method  and  the  classical  method  also  can  be  found  by 
applying  ordinary  least  squares  to  grouping  data.   Both  classical 
and  mathematical  programming  methods  can  indicate  the  magnitude 
and  direction  of  measurement  errors  affecting  the  estimates  of 
regression  parameters.   However,  the  grouping  method  is  more 
applicable  at  least  for  econometricians ,  since  it  requires  much 
less  information;  specifically,  it  does  not  require  knowledge  of 
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the  variances,  a  priori.   That  is  why  it  is  suggested  that  further 
works  in  the  errors -in- variables  model  concentrate  on  grouping 
method. 

The  effects  of  the  magnitude  of  the  ratio  of  measurement 
error  variance  on  locating  the  position  of  the  estimates  of  para- 
meters should  be  investigated  further;  it  may  also  be  useful  the 
finite  sample  properties  of  the  grouping  method  studied  by  using 
Monte  Carlo  methods  for  improving  the  efficiency  of  the  estimators, 
Durbin's  (19  54)  instrumental  variable  method  or  Bartlett's  (19  49) 
three  group  method  can  be  used  to  replace  Wald's  two  group  method. 
The  impact  of  measurement  errors  on  the  functional  form  should  be 
investigated  to  understand  the  interaction  effect  between  the 
specification  bias  and  the  measurement  errors .   The  methods  obtained 
in  this  paper  will  be  extended  to  estimate  the  parameters  of  a 
simultaneous  equation  system  in  the  further  research. 
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